ASR メモ
2025/11/12
色々きてる
aiola/drax-v1 · Hugging Face
Omnilingual ASR Media Transcription - a Hugging Face Space by facebook
Models | ElevenLabs Documentation
2025/10/22
Looking for diarization model better than Pyannote : r/LocalLLaMA
MahmoudAshraf97/whisper-diarization: Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
nvidia/diar_sortformer_4spk-v1 · Hugging Face
pyannote/speaker-diarization-community-1 · Hugging Face
pyannote 4.0
nvidia/diar_sortformer_4spk-v1 · Hugging Face
nvidia/diar_streaming_sortformer_4spk-v2 · Hugging Face
Open ASR Leaderboard - a Hugging Face Space by hf-audio
LLMに音声を聴かせる🎧 → 🧠 →📝 〜 LLM Based ASR〜
話者分離と音声認識 (定番のpyannote.audioでなくNeMoのdiarization modelを利用)+日本語化
GPU-optimized AI, Machine Learning, & HPC Software | NVIDIA NGC
#ASR